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We propose a new global jet-finding algorithm for reconstructing two-prong objects like hadronic 
weak gauge bosons at a hadron collider. The selection of particles in a two-prong jet is required 
to maximize a function, which contains a modified second Fox-Wolfram moment and prefers a 
two-prong structure for a fixed jet mass. Compared to the traditional jet-substrncture method, our 
algorithm can provide a similar or better performance for identifying boosted weak gauge bosons 
that are produced from either Standard Model processes or heavy di-boson resonance decays. 


Introduction. Jet substructure techniques have be¬ 
come essential tools in the search for new physics at 
a hadron collider such as the Larger Hadron Collider 
(LHC). Strategies have been proposed to reconstruct 
hadronically decaying objects such as the W and Z gauge 
bosons, the Higgs boson and the top quark, and ex¬ 
tract from them vital information that was perviously 
unattainable. Most existing jet substructure methods are 
separated into two steps: first, jets are formed with stan¬ 
dard jet-finding algorithms [DE]. The jet size must be 
large enough to include most of the decay products of 
the object. Therefore the jets are “fat”. Fat jets are 
then processed with a grooming algorithm [3H7] or other 
methods to remove soft particles that are unlikely from 
the object decay. An alternative, relatively unexplored 
approach is to develop a new jet-clustering algorithm to 
directly reconstruct the multi-prong objects, utilizing its 
structure and properties. ^ The algorithm could be more 
specialized to suit different objects, and, if carefully de¬ 
signed, should be more efficient to reconstruct the phys¬ 
ical object than ordinary jet-finding algorithms because 
it contains more physics information. 

To reconstruct a single-prong jet, one can choose a 
set of particles to maximize a jet function that prefers 
a large jet momentum but a small jet mass 
The relevant jet function for one-prong jet only depends 
on the summed four-momenta of particles belonging to 
the jet. Following the same philosophy, in this paper we 
develop a new jet-finding algorithm to reconstruct two- 
prong jets. Since the two-prong structure can be inter¬ 
preted as a jet shape, the immediate question is how to 
add an additional component in the jet function to prefer 
a two-prong structure. There is a large amount of liter¬ 
ature on event shapes at a lepton collider including the 
Fox-Wolfram (FW) moments [TBIIT7] . We will show that 


^ The W-jettiness/XCone IS UlOl method can be viewed as such an 
attempt, although in which a fixed number of jets in an event 
are constructed. 

^ See Ref. 1151 for its connection to other jet finding algorithms. 


with some modifications, we can incorporate the second 
Fox-Wolfram moment into the jet function, maximizing 
which prefers a two-prong structure for a jet. 

Our new jet-finding algorithm is a global one because 
all subsets of particles will be considered to maximize the 
jet function. It has advantages of including only relevant 
particles for the physical object, and being infrared and 
collinear safe based on the argument in Ref. m and 
the discussion on FW moments in Refs. m HI]. On 
the other hand, it also has its disadvantage because 2^ 
subsets need to be checked, where N is the number of 
particles in an event, and an unrealistic computing power 
is required for just dozen of particles. To reconcile this 
problem, we provide a double-anti-A:t-cone algorithm to 
find the approximate maximum of the jet function. As we 
will demonstrate, this approximation matches very well 
the true global maximum. As a result, the double-anti- 
fej-cone algorithm has a realistic number of operations of 
where ur is the number of different anti-/c( jet 

sizes. 

Two-prong Jet Function. Similarly to the studies in 
Refs. prifTS] for a one-prong jet, we maximize the follow¬ 
ing two-prong jet function to select the subset of particles 
in an event as a jet 

— f3 + 7 (lepton colliders), (1) 

— j3 H 2 (hadron colliders). (2) 


Here, E and M are the jet energy in the lab frame and jet 
mass for the subset of (massless) particles. For hadron 
colliders, we use the transverse energy defined as = 
E"^ — P^. In addition to the one-prong jet function m, 
we have added an additional component, 


H2 = 


[S 


iPi -Pjf 

{Pj ■ Pi) {Pj ■ pj) 



(3) 


with Pj = J2iPi the total four-momentum of the subset 
and Pi - Pj =ptPjt,- 

A simpler way to understand this new function is to 
boost all particles to the rest frame of Pj, in which one 
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has 


H. = 


EiEj cos^ 9ij 


rest frame 


( 4 ) 


Here, d^j is the angle between two particles in the rest 
frame of Pj. For a fixed invariant mass J2i El^^ frame _ 
M, one can easily show that H 2 is maximized when 
there are only two back-to-back massless prongs. Our 
function is related to the FW moments MM. Hi = 
\A\ \Pj\PiicosOij)/E‘^, by H 2 = E‘^{2H2 + l)/3. It 
is also interesting to note that the zeroth and first FW 
moments are equivalent to the jet energy and mass, re¬ 
spectively. Therefore, our can be expressed just using 
Ho,1,2- 

For the one-prong jet function in Refs. [TIHT3] , it was 
shown that the jet parameter /3 controls the size of the 
jet with the jet cone radius scaling as 1/v^ for a large 
p. Our two-prong jet functions in Eqs. 0 and 0 have 
two parameters, /3 and 7 . Similar to the one-prong 
case, (3 should be positive to avoid clustering all par¬ 
ticles in an event into a single jet. Furthermore, we 
also require /? > 7, otherwise two back-to-back groups 
of particles are preferred in a single jet. This can be eas¬ 
ily seen for two massless energetic particles, which have 
= E"^ — {[3 — 7 ) and {/3 — 7 ) as the effective (3 for 
the one-prong jet function. Therefore, we anticipate that 
the parameter l/V/3 — 7 for a large (/3 — 7 ) roughly con¬ 
trols the geometric size of a fat two-prong jet. At hadron 
colliders, in order to exclude particles in the beam direc¬ 
tion, we also require /? — 7 > 1. 

To further understand the shape of an object ob¬ 
tained by maximizing the two-prong jet function, we add 
an infinitely soft particle into the system of two ener¬ 
getic massless prongs to determine the passive catchment 
area m- Since the generic feature at a hadron collider 
is similar to the one at a lepton collider, we study the 
jet function at a lepton collider for simplicity. Defining 
the weighted angular distances of the soft particle with 
respect to the hard particles 


2 F;i (1 - cos 6 »i) 
1 - El+E2 


_ 2^2 ( 1 - cos 6*2 


with 9i and 02 as the angles between the soft particle and 
the two hard particles, we derive the following condition 
for the soft particle to increase the value of 


/3(di + ^ 2 )^ ~ 'y{di ~ ^ 2 )^ < 2(fii -|- 9,2) ■ 


( 6 ) 


Choosing the two hard particles with Ei = 3 E 2 located 
at (7r/2, ± 7 r/ 12 ), we show the boundary of soft particles 
in Fig. for different values of (3 and 7 . From Eq. ^ 
and Fig. one can see that for a large value of (3 the 
soft particles are close to one of the two hard particles 
and the allowed region consists of two separated cones. 
As f3 decreases, the two cones expand and overlap with 



FIG. 1: The boundaries of soft particles surrounding two hard 
particles with Ei — SE 2 on the 9 — (f) plane for different values 
of (3 and 7 . The locations of the two hard particles are marked 
with the black dots. 


each other. For a small value of /3, the “fat jet” has a 
single cone with two subjets inside, which is what the 
traditional jet-substructure method searches for. 
Double-anti-kT-Cone Algorithm. It is unrealistic to 
exhaust all 2 ^ possible subsets of particles to maximize 
our jet function, even for several dozen of particles in an 
event. In practice, however, one can find approximate 
and fast ways to identify a subset of particles that pro¬ 
vide a jet function value close to the true maximum. We 
provide one such algorithm built on top of the anti-fct 
algorithm [Si- 

Given the observation in Fig. that the jet shape 
can be described roughly by two separate or overlapping 
cones, one could consider all possible pairs of cones (and 
individual ones), and identify the one with the largest 
as the jet candidate. Specifically, our jet-finding al¬ 
gorithm is described as following 

1. Preparing candidate cones: for a given event, col¬ 
lect anti-A:* (or other fast cone-shape-jet-finding al¬ 
gorithm) jets with various jet radii. For instance, 
one can scan the jet radii in nn ~ 0 ( 10 ) steps 
with a maximum value of i?max ~ 0(1). In the fol¬ 
lowing simulation, we will choose i?max = 0.7 and 
Amin — Rmax/fr-i? — 0.05. 

2. Maximizing for each pair of (and individual) 
anti-fct jets, calculate based on the particles 
belong to this pair of (or individual) anti-fct jets. 
Particles shared by two different jets are counted 
only once. Find the pair (or the individual one) 
that maximizes as the leading jet. 

3. Iterating: remove particles that belong to the lead¬ 
ing jet, and repeat the procedure to find the next 
jet, until all particles in an event are exhausted. 
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The running time of this algorithm for N particles scales 
as N'^n\. ^ We test the goodness of the above approxi¬ 
mate algorithm against the true maximum using a sample 
of hadronic WW events generated with a > 200 GeV 
cut. Since the true maximum requires us to exhaust all 
2 ^ possible subsets of particles, we only keep the leading 
20 particles in our test. We have found that over 99% of 
the events, the double-anti-fer-cone algorithm finds ex¬ 
actly the same set of particles for the leading jet as the 
exhaustive subset method. For the remaining less than 
one percent of events, the particle contents only differ 
slightly with the difference of the jet pr’s less than 10 %. 
Results for W-jet Tagging. Using boosted VF-bosons 
as an example, we compare the results from our jet¬ 
finding algorithm with other traditional jet-substructure 
methods including the jet grooming methods [MS]- For 
the signal, we consider the Standard Model WW pair 
production at the 14 TeV LHC with both W’s decay 
hadronically. The dominant background is QCD dijets. 
Both signal and background events are simulated with 
PYTHIAS 1 ^. To mimic the pileup effects, we also add 
20 soft QCD events for each hard event. 




FIG. 2: Left panel: jet mass distributions for the leading two 
jets. The jet algorithm parameters for obtaining the largest 
signal efficiency can be found in Table for > 300 GeV. 
Right panel: the receiver operating characteristic curves for 
the signal and background efficiencies by varying the cut on 
the V-subjettiness ratio variable T 2 /T 1 [22j . 

Imposing a parton-level px > 300 GeV cut on the 
FF’s and QCD jets, we show the reconstructed jet mass 
distributions in the left panel of Fig. [^ comparing our 
jet-finding algorithm and the traditional jet-substructure 
analysis with a fat jet plus filtering or pruning. We have 
chosen to maximize the signal acceptance for jet mass 
in the (60,100) GeV window, and the optimum jet pa¬ 
rameters are shown in Table |l| From the left-panel of 
Fig. HI one can see that our jet-finding algorithm identi¬ 
fies more FF-jets in the mass window. For events within 
the (60,100) GeV mass window, we further calculate the 
iV-subjettiness ratio, T 2 /T 1 |52|. We apply a varying cut 
on T 2 /ri to produce the receiver operating characteristic 


® Our numerical code can be found in https://github.coni/ 
LHCJet/JETII, which is based on Fast Jet m- 


(ROC) curves, shown in the right panel of Fig. Com¬ 
paring to filtering and pruning, our jet algorithm provides 
a comparable or larger signal acceptance for most of the 
background efficiencies. The ROC curve of pruning is 
very close to the one when the signal efficiency is 
below 60%. 


W 

Pt > 

Filtering 

Pruning 

JEt 

R{C/A) 

-^filter 

i?(anti-A:t) 

^cut 

P 

7 

200 GeV 

1.6 

0.35 

1.0 

0.075 

6 

5 

300 GeV 

1.4 

0.30 

0.8 

0.05 

8.5 

7 

400 GeV 

1.0 

0.25 

0.8 

0.05 

11 

9.5 

800 GeV 

0.6 

0.15 

0.6 

0.025 

12 
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TABLE I: The optimized jet-finding parameters that provide 
the maximal signal efficiency for different px cuts in the jet 
mass window of (60,100) GeV. For pruning, we start from 
an anti-fct fat jet and prune it using the Gambridge/Aachen 
(G/A) algorithm. The additional parameter D^ut is fixed 
to be Dcut = rnjjpTj- A smaller value of Dcut provides 
marginally larger signal efficiencies, but with the signal peak 
shifted to a lower value. 

To test our jet-finding algorithm for FF’s with dif¬ 
ferent pt’s, we scan the jet parameters to obtain the 
maximal signal efficiency and show the values of the 
jet parameters in Table |T| For the fat jet plus pruning 
method, we choose to start with anti-fct fat jets, which 
give us slightly better results than starting from Cam¬ 
bridge/Aachen (C/A) [32 HH fat jets, and then prune 
them using the C/A algorithm. For the fat jet plus fil¬ 
tering method, we use the C/A algorithm for both the 
fat jet and the filtering procedure. We have found that 
including only two sub jets provides a larger signal effi¬ 
ciency in the jet mass window of (60,100) GeV, than 
three subjets. For our algorithm, larger values of (3 
and /3 — 7 are preferred for FF’s with a larger pp. 

After fixing the jet parameters for different pp cuts, 
we show the background fake rate in Fig. [^ with a fixed 
signal efficiency of 70%. From Fig. one can see that 
for less boosted FF’s with a small px cut, the filtering 
method better identifies the two isolated jets and does a 
slightly better job in rejecting the background. For very 
boosted VF’s, our and the pruning methods provide 
comparable results, which are better than the filtering 
method because they do not force two separate prongs as 
the filtering does. For the intermediate pp cuts, our 
algorithm has smaller background fake rates than the 
other two methods. Therefore, one can use our method 
to more efficiently tag hadronic FF-jets for a wide range 
of px's. 

Results for Heavy Diboson Resonance. To further 
illustrate the performance of our algorithm, we consider a 
beyond-the-Standard-Model-physics example, motivated 
by the recent ATLAS searches [22 for diboson resonances 
using jet-substructure techniques. We consider a heavy 
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FIG. 3: The QCD dijet background fake rate for different 
Pt cuts with a fixed signal efficiency at 70%. The jet mass 
window is chosen to be (60,100) GeV. The pruning case with 
Pt > Pt'’^ ~ 200 GeV is not shown because the maximal 
signal efficiency that it can reach is only around 60%. 


W gauge boson in the model of Ref. [55], produced at 
the 8 TeV LHC with 20 fb“^ integrated luminosity. We 
fix the W mass to be 2 TeV and let it decay to W + Z, 
both of which decay hadronically. 




FIG. 4: Left panel: the signal and background distribntions 
for the leading two jet masses from our and declnstering- 
filtering (DF) procednres. Right panel: the distribntions for 
the invariant masses of two leading jets with the lower(higher) 
jet mass within a 26 GeV mass window centered at the IV(Z) 
mass. 

Following the ATLAS analysis in Ref. |5^ , we also use 
the declustering-filtering (DF) procedure to identify the 
W and Z bosons and to keep the QCD dijet mistag rate 
to be around 1%. Starting from R — 1.2 C/A fat jet with 
Pt > 540 GeV, we impose the same cuts on the mass- 
drop, subjet momentum balance and filtering parameters 
with fif = 1, = 0.2 and Rr = 0.3. For our 

jet-finding algorithm (/3 = 100 and 7 = 92) and to further 


suppress the QCD background, we impose an additional 
cut on the momentum balance to utilize the fact that 
a QCD jet has more radiation around its axis than a 
WjZ jet: in each fat jet, we construct the subleading 
around the leading one and require its transverse 
momentum to be smaller than 0.015 times that of the 
leading one. ® 

After selecting the two leading jets with the 
lower(higher) jet mass within a 26 GeV mass window 
centered at the FF(Z) mass, we require them to have 
\yi - 2/2I < 1-2 and (pTi -PT2)/iPTi +PT2) < 0.15 like in 
the ATLAS analysis. We show the distributions of the 
individual jet mass in the left panel of Fig. and the 
dijet invariant mass distributions in the right panel. The 
left panel of Fig. Wl clearly shows that our algorithm 
is more efficient than the DF method to reduce QCD di¬ 
jet events with a jet mass within the WjZ boson mass 
window. From the right panel and imposing an invariant 
mass window cut of (1.8, 2.2) TeV, our method has 
improved Sj^fB by 15% compared to the DF method. 
We also note here that the QCD dijet invariant mass dis¬ 
tributions from our method has a harder tail and a bump 
around 1.9 TeV. This can be understood from the basic 
properties of our jet function in Eq. ([^, which prefers 
jets with a small jet mass but a large pp- The bump 
location is generated from the basic pp cut on the lead¬ 
ing two jets and the additional cut on the surrounding 
subleading jet pp. 

Discussion and Summary. Our jet-finding algorithm 
has the advantage of identifying a smaller set of parti¬ 
cles relevant for the boosted two-prong objects. On the 
other hand, it cannot capture other properties of boosted 
objects like the number of charged particles, the mo¬ 
mentum balance of subjets or other color information of 
the jet (55] [55]. One should also apply the existing jet- 
substructure variables to our jets to further enhance 
the new physics discovery potential. 

One could simply apply the algorithm to iden¬ 
tify the boosted Higgs boson with h —>■ bb. For the sin¬ 
gle 6 -tagging case, one can first identify a two-prong jet 
and then check the additional 6 -tagging information for 
particles belong to this jet. For the double 6 -tagging 
case, h ^ bb already guarantees the two-prong structure 
and our may not add any additional discrimination 
power. It is interesting to see how to extend our 
algorithm together with the 6 -tagging method. 

Beyond the two-prong jets, one could follow the gen¬ 
eral philosophy in this paper to identify a jet function 
favoring other decay topologies. There is not a single 
Fox-Wolfram moment that favors a three-prong jet like 
a hadronic top. On the other hand, it is still possible 


^ Our goal is not to exactly reproduce ATLAS results, which con¬ 
tain a detector simulation. For instance, we do not impose the 
number-of-track cut in our analysis, which could be more sensi¬ 
tive to detailed pileup information. 


® One could also design a “telescoping” jet cut m by choosing a 
different set of /3 and 7 . 
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to define a jet function using a series of Fox-Wolfram 
moments, or other functions constructed to match the 
special event shape [30] . 

In conclusion, we have introduced a new jet-finding 
algorithm that favors two-prong jets. For the Standard 
Model WW productions with two hadronic IF’s, our 
algorithm provides performance close to or better than 
the results from traditional jet grooming methods like 
filtering and pruning for a wide range of pt’s. For a 
heavy di-boson resonance decaying to two hadronic weak 
gauge bosons, our algorithm can further improve the 
new particle discovery significance. 
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